Human dorsal striatal activity during choice discriminates reinforcement learning behavior from the gambler's fallacy.

نویسندگان

  • Ryan K Jessup
  • John P O'Doherty
چکیده

Reinforcement learning theory has generated substantial interest in neurobiology, particularly because of the resemblance between phasic dopamine and reward prediction errors. Actor-critic theories have been adapted to account for the functions of the striatum, with parts of the dorsal striatum equated to the actor. Here, we specifically test whether the human dorsal striatum--as predicted by an actor-critic instantiation--is used on a trial-to-trial basis at the time of choice to choose in accordance with reinforcement learning theory, as opposed to a competing strategy: the gambler's fallacy. Using a partial-brain functional magnetic resonance imaging scanning protocol focused on the striatum and other ventral brain areas, we found that the dorsal striatum is more active when choosing consistent with reinforcement learning compared with the competing strategy. Moreover, an overlapping area of dorsal striatum along with the ventral striatum was found to be correlated with reward prediction errors at the time of outcome, as predicted by the actor-critic framework. These findings suggest that the same region of dorsal striatum involved in learning stimulus-response associations may contribute to the control of behavior during choice, thereby using those learned associations. Intriguingly, neither reinforcement learning nor the gambler's fallacy conformed to the optimal choice strategy on the specific decision-making task we used. Thus, the dorsal striatum may contribute to the control of behavior according to reinforcement learning even when the prescriptions of such an algorithm are suboptimal in terms of maximizing future rewards.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reward in the Human Brain During Instrumental Learning With Juice and Money Overlapping Prediction Errors in Dorsal Striatum

[PDF] [Full Text] [Abstract] , May , 2010; 103 (5): 2506-2512. J Neurophysiol Signe Bray, Shinsuke Shimojo and John P. O'Doherty Real Rewards Human Medial Orbitofrontal Cortex Is Recruited During Experience of Imagined and [PDF] [Full Text] [Abstract] , August 23, 2010; . Cereb. Cortex Hackjin Kim, Shinsuke Shimojo and John P. O'Doherty Ventromedial Prefrontal Cortex Overlapping Responses f...

متن کامل

An fMRI study of risk-taking following wins and losses: implications for the gambler's fallacy.

Human decision-making involving independent events is often biased and affected by prior outcomes. Using a controlled task that allows us to manipulate prior outcomes, the present study examined the effect of prior outcomes on subsequent decisions in a group of young adults. We found that participants were more risk-seeking after losing a gamble (riskloss) than after winning a gamble (riskwin),...

متن کامل

The gambler's fallacy in penalty shootouts

A well-known bias in subjective perceptions of chance is the gambler's fallacy: people typically believe that a streak generated by a series of independent random draws, such as a coin toss, becomes increasingly more likely to break when the streak becomes longer. In a fascinating study, Misirlisoy and Haggard analysed sequential behavior of kickers and goalkeepers in penalty shootouts. They re...

متن کامل

Changes in corticostriatal connectivity during reinforcement learning in humans.

Many computational models assume that reinforcement learning relies on changes in synaptic efficacy between cortical regions representing stimuli and striatal regions involved in response selection, but this assumption has thus far lacked empirical support in humans. We recorded hemodynamic signals with fMRI while participants navigated a virtual maze to find hidden rewards. We fitted a reinfor...

متن کامل

Losses and External Outcomes Interact to Produce the Gambler’s Fallacy

When making serial predictions in a binary decision task, there is a clear tendency to assume that after a series of the same external outcome (e.g., heads in a coin flip), the next outcome will be the opposing one (e.g., tails), even when the outcomes are independent of one another. This so-called "gambler's fallacy" has been replicated robustly. However, what drives gambler's fallacy behavior...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • The Journal of neuroscience : the official journal of the Society for Neuroscience

دوره 31 17  شماره 

صفحات  -

تاریخ انتشار 2011